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macroststes  are  first  obtained  for  the  complete  graph  and  then  extended  to  the  case  of  uncorrelated  networks  of 
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Exact  Solution  of  the  Markov  Propagator  for  the  Voter  Model  on  the  Complete  Graph 


William  Pickering  and  Chjan  Lim 
Rensselaer  Polytechnic  Institute 
(Dated:  July  1,  2014) 

The  voter  model  is  a  simple  and  well  studied  model  in  network  science  in  which  nodes  in  a  graph 
have  one  of  two  states.  We  assume  throughout  the  discussion  that  every  node  is  connected  to  all 
other  nodes.  By  restricting  the  network  topology  in  such  a  way,  we  are  able  to  find  a  very  detailed 
solution  of  the  model.  A  single  step  propagator,  as  well  as  the  solution  to  its  spectral  problem, 
will  both  be  derived  exactly  by  means  of  generating  function  techniques.  The  eigenvalue  problem 
will  then  be  cast  in  the  thermodynamic  limit  as  the  number  of  nodes,  N,  becomes  infinite,  and 
it  is  found  that  the  eigenvectors  will  approach  polynomials.  Also,  using  the  exact  solution  to  the 
spectral  problem,  we  can  find  the  time  m  probability  distribution.  We  then  use  this  to  find  two 
quantities.  This  first  is  the  expected  time  until  the  system  reaches  total  agreement  and  the  second 
is  the  expected  frequency  that  each  macrostate  is  achieved  before  consensus  is  attained.  Other 
quantities  of  interest  will  follow  in  closed  form  given  the  solutions  provided  here. 


I.  INTRODUCTION 

Quantitative  insight  into  social  opinion  dynamics  can 
be  found  from  a  modeling  perspective.  One  of  the  most 
well  studied  of  which  is  the  voter  model  [1,  2].  This 
model  has  been  solved  analytically  for  several  quantities 
[3-7].  It  also  has  applications  in  many  areas  of  physi¬ 
cal  science,  such  as  particle  interactions  and  kinetics  of 
catalytic  reactions  [6] .  Social  opinion  dynamics  is  a  typ¬ 
ical  view  of  this  particle  interaction  model,  where  there 
are  two  opposing  opinions  in  a  political  discussion.  The 
fundamental  social  assumption  about  the  model  is  that 
individuals  are  strongly  influenced  by  the  beliefs  of  their 
neighbors.  Another  interpretation  of  the  voter  model  is 
that  species  compete  for  territories,  where  a  change  in 
state  corresponds  to  and  invasion  [2,  8].  Extensions  and 
generalizations  of  this  model  also  have  been  studied  in 
detail  [9-14]. 

A  specific  and  direct  application  of  the  voter  model  is 
to  population  genetics.  In  the  context  of  a  binary  coded 
genetic  algorithm,  the  voter  model  on  a  complete  graph 
can  be  interpreted  as  the  crossover  operator  of  bit  strings 
over  a  single  bit.  This  is  the  exchange  of  bits  once  two 
parent  strings  have  been  selected  [15].  The  voter  model 
is  an  instance  of  the  genetic  algorithm  that  examines  the 
case  where  selection  is  uniform  (constant  fitness),  only 
one  child  string  is  generated  per  crossover,  and  there  are 
no  mutations.  A  biased  voter  model,  which  introduces 
a  fitness  value  for  each  state,  also  has  been  studied  [9]. 
The  genetic  algorithm,  with  or  without  mutations,  is  a 
subject  of  interest  from  an  analytical  perspective  [15-17]. 
The  work  presented  here  will  provide  some  additional 
analytical  insight  into  unbiased  genetic  drift. 

Many  mathematical  models  for  opinion  dynamics  can 
be  viewed  as  discrete  time  Markov  chains.  It  is  known 
that  the  eigenvalues  and  eigenvectors  of  the  transition 
matrix  of  the  Markov  chain  have  a  vital  role  in  the  dy¬ 
namics  of  the  model.  In  general,  the  second  largest  eigen¬ 
value  is  related  to  an  approximate  rate  of  convergence  to 
equilibrium  [18].  For  the  voter  model,  this  is  when  all 
nodes  have  the  same  state.  All  dynamics  of  the  model 


halt  entirely  once  this  network  state  is  attained.  On  a 
connected  graph,  the  expected  time  to  attain  this  unan¬ 
imous  state  can  be  bounded  in  terms  of  the  spectral  gap 
of  the  transition  matrix  [19].  If  one  had  access  to  the 
complete  spectral  decomposition  of  the  transition  matrix, 
then  any  future  probability  distribution  can  be  computed 
in  closed  form.  In  this  paper,  we  offer  a  method  for  find¬ 
ing  the  closed  form  solution  of  the  spectral  problem  that 
yields  such  a  decomposition  for  an  instance  of  the  voter 
model. 

The  voter  model  can  be  cast  as  an  urn  problem  simi¬ 
lar  to  the  Ehrenfest  model  [20].  In  this  model,  there  are 
two  urns  with  N  balls  divided  amongst  them.  In  a  single 
time  step,  one  ball  is  chosen  at  random  and  is  transferred 
to  the  other  urn.  This  process  is  repeated  ad  infinitum. 
The  voter  model  on  the  complete  graph  has  a  similar  in¬ 
terpretation.  Here,  two  balls  are  selected,  one  after  the 
other.  After  selection,  both  of  the  balls  are  placed  in  the 
urn  from  which  the  second  ball  came.  When  cast  as  an 
urn  problem,  the  network  topology  is  identical  to  a  com¬ 
plete  graph.  A  closed  form  diagonalization  of  the  Markov 
transition  matrix  for  the  Ehrenfest  model  was  found  by 
Mark  Kac  in  1947  using  similar  techniques  we  will  utilize 
here  [21].  We  will  propose  a  generalization  of  those  tech¬ 
niques  which  allows  us  to  solve  the  spectral  problem  for 
the  voter  model.  This  particular  solution  of  the  model 
will  give  exact  analytical  expressions  for  several  quan¬ 
tities  of  interest.  The  two  specific  applications  of  the 
solution  to  the  spectral  problem  that  we  will  provide  are 
the  expected  time  to  consensus,  and  the  expected  time 
each  macrostate  is  visited  before  reaching  consensus. 

An  outline  of  the  paper  is  as  follows.  Section  II  will  de¬ 
scribe  more  of  the  details  of  the  model  and  the  associated 
random  walk.  In  section  III,  we  shall  derive  a  straightfor¬ 
ward  procedure  to  exactly  solve  the  eigenvalue  problem 
of  the  Markov  transition  matrix  for  the  macrostates  of 
the  system.  With  such  a  solution,  one  can  compute  the 
probability  distribution  at  any  future  time  in  closed  form. 
In  section  IV,  we  use  this  solution  to  find  two  quantities: 
the  expected  time  before  reaching  consensus  and  the  ex¬ 
pected  time  spent  at  each  macrostate.  These  expressions 
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are  functions  of  the  initial  probability  distribution,  which 
is  kept  arbitrary.  Although  an  asymptotic  treatment  of 
consensus  times  is  known  [9,  22],  we  will  improve  upon 
those  results  significantly  by  providing  an  exact  value  for 
the  expected  time  to  consensus  for  any  initial  probability 
distribution. 


II.  THE  2-STATE  VOTER  MODEL 

Although  the  model  that  we  consider  is  imposed  upon 
a  complete  graph  of  N  nodes,  more  general  graphs  are 
studied  as  well  [3,  9,  23,  24].  Each  node  is  assigned  one 
of  two  states,  A  or  B.  In  a  single  time  step,  a  node 
is  chosen  randomly  and  will  assume  the  state  of  one  of 
its  neighbors,  also  chosen  randomly  [2].  Note  that  in 
this  procedure,  it  is  possible  that  the  network  state  may 
remain  unchanged  for  several  time  steps. 

Let  nA{m)  and  nB{ni)  represent  the  total  number  of 
agents  taking  opinion  A  and  B  respectively  at  time  m. 
Since  the  total  number  of  nodes  must  be  conserved  and 
all  nodes  must  take  one  of  these  two  opinions,  it  is  nec¬ 
essary  that  n^(m)  -f  nsim)  =  N .  Since  V  is  a  constant, 
this  will  allow  us  to  simplify  the  model  to  a  random  walk 
in  a  single  variable,  say  ha- 

Now  let  us  formalize  the  problem  as  a  random  walk  in 
UA-  We  can  write  it  as 

nA{m  +  1)  =  nA{rn)  + /^nA{rn).  (1) 

For  a  given  time  step  m,  ua  is  considered  to  be  a  known 
constant,  and  the  random  behavior  is  exhibited  in  Aua- 
Since  only  a  single  node  is  updated  per  time  step,  Aua 
only  takes  values  from  {—1,0,1}.  Let  pj  =  ■ 

Then,  from  the  definition  of  the  model,  the  probabilities 
of  taking  these  values  are: 

Pr{AnA{m)  =  l\nA{m)  =  j)  =  pj  (2) 

Pr{AnA{m)  =  -l\nA{m)  =  j)  =  pj  (3) 

Pr{AnA{m)  =  0\nAim)  =  j)  =  1  -  2pj  (4) 

Thus,  the  random  walk  is  determined  given  an  initial 
condition  nA(0)  =  n. 

A.  Markov  Propagator  By  Generating  Fnnctions 

In  this  section,  we  will  outline  a  general  procedure  that 
provides  a  recurrence  relation  for  the  probability  distri¬ 
bution  of  general  random  walks.  The  strategy  is  to  con¬ 
struct  a  sequence  of  generating  functions  for  the  proba¬ 
bility  distribution  of  the  random  walk.  This  process  can 
also  be  applied  to  other  models,  incomplete  graphs,  or 
to  multiple  dimensions.  An  advantage  of  this  approach 
is  that  it  is  highly  generalizable  in  these  ways  and  only 


requires  minor  modihcation  to  do  so.  The  procedure  can 
be  applied  to  find  the  probability  distribution  of  either 
the  microstates  or  the  macrostates  of  the  model.  Here, 
the  macrostate  approach  will  be  utilized  since  the  model 
is  imposed  on  a  complete  graph. 

To  begin,  represent  the  probability  distribution  in 
generating  function  form.  Let  =  Pr(jiAim)  = 

j).  We  introduce  a  sequence  of  generating  functions 
and  seek  to  find  a  relationship  be¬ 
tween  and  R^'^\x).  From  the  random  walk 

form  of  the  model,  the  probability  generating  function  for 
An^  is  Dj{x)  =  pjX  -b  (1  —  2pj)  +  pjX~^ .  We  will  make 
use  of  the  following  properties  of  generating  functions  in 
the  derivations  to  follow: 

Product  rule:  If  X  and  Y  are  integer  random  variables 
with  probability  generating  functions  F{x)  and  G{x)  re¬ 
spectively,  then  the  generating  function  oi  X  +  Y  is 
F{x)G{x). 

Sum  rule:  If  the  probability  space  is  partitioned  into 
N  events,  each  with  generating  function  Fj{x),  then  the 
generating  function  for  the  entire  space  is  Fj{x) 

[25-27]. 

For  time  step  to,  suppose  that  nA(jn)  =  j.  Note  that 
xi  ig  corresponding  generating  function  for  this 
event.  Now,  utilize  the  product  rule  in  equation  (1)  to 
deduce  that  the  probability  generating  function  for  time 
TO  -b  1  is  x^ D j{x)  in  the  event  that  nA(jn)  =  j.  By 

the  sum  rule,  we  have  that 

N 

R(m+i)^x)  =  (5) 

3=0 

This  is  the  generating  function  form  of  the  Markov  prop¬ 
agator  of  the  random  walk.  This  can  be  easily  generalized 
to  other  models  simply  by  specifying  the  appropriate  ex¬ 
pression  for  Dj{x)  in  equation  (5).  For  the  given  network 
topology,  the  sum  is  simple  enough  to  collect  terms  and 
obtain  an  explicit  equation: 

=  Pj-io-^l  -b  (1  -  +  pj+ia^'^l-  (6) 

While  the  procedure  to  find  a  relationship  between 

Rim+l)  ^(m) 

can  be  highly  generalizable,  we  pose 
a  new  power  series  that  will  be  utilized  directly  to 
solve  the  formulation  in  equation  (6).  Let  Q'^'^\x,y)  = 
Some  properties  of  generating  functions 
of  this  type  are  listed  below: 

1.  Multiply  by  x/y  to  shift  a'f'’ 

2.  Multiply  by  y/x  to  shift 

3.  The  generating  function  for  Pjof^'^  is  ■ 

Using  these  properties,  we  rewrite  equation  (6)  as 

(x-yfQ^:;;^  =NiN-l)A+^Q(^\ 


(7) 
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Here,  A+m  is  the  forward  difference  operator  in  the  dis¬ 
crete  variable  m.  The  goal  of  the  subsequent  section  is  to 
solve  this  equation  explicitly  for  m  to  find  the  probability 
distribution  for  arbitrary  time  steps.  To  do  so,  we  turn 
to  the  spectral  problem. 

III.  THE  SPECTRAL  PROBLEM 


Recall  that  we  had  required  cj  =  0  for  j  <  0  and  j  >  N. 
This  is  also  true  for  bj  since  we  applied  a  linear  transfor¬ 
mation.  Therefore,  this  difference  equation  would  sug¬ 
gest  that  every  bj  =  0  unless  it  has  singular  behavior 
for  some  value  of  j  €  {0...N},  say  when  j  =  k.  At  this 
singular  point,  set  the  coefficient  of  bk  to  0.  This  allows 
the  value  of  bk  to  be  non-zero,  and  thus  not  all  bj  =  0. 
Therefore,  the  set  of  eigenvalues  is  determined  to  be 


It  is  clear  from  equation  (6)  that  the  future  prob¬ 
ability  distribution  can  be  expressed  as  a  tridiagonal 
transition  matrix  multiplied  by  the  probability  vector 
a(m)  =  [aj^^]^^i.  Solving  the  spectral  problem  provides 
a  basis  for  the  initial  distribution,  which  allows  future 
time  steps  to  be  computed  in  closed  form.  We  find  the 
solution  to  this  problem  here. 


A.  Matrix  Eigenvalue  Problem 

For  eigenvalue  A  with  eigenvector  v  =  we  have 

that  the  spectral  problem  for  the  propagator  can  be  writ¬ 
ten  as 

Pj-iCj-i  -b  (1  -  2pj)cj  +  Pj+iCj+i  =  Xcj.  (8) 

The  solution  to  the  problem  begins  by  defining  a  multi¬ 
variate  generating  function  for  Cj  as 


Now  we  can  find  the  eigenvectors.  Since  the  equation 
for  bj  has  singular  behavior  at  bk,  the  value  at  this  point 
is  arbitrary.  This  is  expected,  since  any  multiple  of  an 
eigenvector  remains  an  eigenvector.  Without  any  loss, 
let  =  1.  Now  we  can  ascertain  the  explicit  solution  for 
bj  when  k  <  j: 


bj 


n 


{i-l)iN-i  +  l) 
N{N-l){Xk-l)  +  i{i-l)' 


(14) 


When  j  <  k,  we  have  bj  =0.  Thus  all  values  of  bj  are 
determined  for  a  given  eigenvalue  Xk-  Note  that  in  (14), 
the  requirement  that  bj  =  0  is  satisfied  when  j  >  N . 

Now,  we  use  bj  to  find  the  components  of  the  eigen¬ 
vector,  Cj.  To  do  this,  express  H {u,  y)  in  the  original  x,  y 
variables: 


G{x,y)  =  Y^CjX^y^  (9) 

i 

We  seek  a  differential  equation  for  the  eigenvalue  prob¬ 
lem.  This  procedure  is  similar  to  Kac’s  solution  of  the 
Ehrenfest  model  [21].  Here,  however,  we  will  obtain  a 
PDE  instead  of  an  ODE.  Using  the  formulation  in  sec¬ 
tion  H  gives  the  following  partial  differential  equation  for 
G: 

ix-yfG,y  =  N{N-l)iX-l)G.  (10) 

Note  that  this  equation  has  no  boundary  or  initial  condi¬ 
tions.  To  find  the  appropriate  solutions  to  this  equation, 
we  assume  that  it  must  take  the  form  specified  in  equa¬ 
tion  (9),  not  all  coefficients  Cj  =  0,  and  that  Cj  =  0  for 
j  <  0  and  j  >  N. 

To  solve  this,  use  the  change  of  variables  u  =  x  —  y  and 
H(u,y)  =  G(x,y)  to  transform  the  partial  differential 
equation  into 

u^{Huy-Huu)=N{N -1){X-1)H.  (11) 

Since  this  is  a  linear  change  of  variables,  we  still  expect 
solutions  to  be  of  the  form  i7(M,  y)  =  bjU^y^~K  Sub¬ 
stituting  this  into  the  new  partial  differential  equation  for 
H  and  collecting  like  terms  will  give  the  difference  equa¬ 
tion  for  bji 

(j  -  l)(iV  -  J  +  1)6, _i  =  (jij  -  1)  +  NiN  -  1)(A  -  1))6,. 

(12) 


G{x,y)  =  H{u,y)  (15) 

=  Y^b.{x-yyy^-^  (16) 

i 

N  I 

=  (17) 

i^o  j^O  ^ 

N  N  z  .s 

Therefore  c,-  =  (“!)*”! lli®  spectral 

problem  is  solved  in  closed  form. 

B.  Differential  Eigenvalue  Problem 

In  this  section,  we  will  examine  the  eigenvectors  in 
more  detail.  In  particular,  we  wish  to  consider  the  ther¬ 
modynamic  limit  of  the  model  as  fV  — >■  oo  and  study  the 
behavior  of  the  eigenvectors.  To  do  this,  note  that  the 
spectral  problem  can  be  posed  as: 

A2(p,c,)  =  (Afe-l)c,  (19) 

Where  A^  is  the  second  centered  difference  operator.  Let 
Xj  =  j/N,  Ax  =  1/A  and  u{xj)  =  Cj.  Then, 

A^j{pju{xj)) 

Ax^ 


A^(Afc  -  l)u{xj). 


(20) 
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7th  Eigenvector  and  the  Hypergeometric  Limit,  N=100 


FIG.  1.  7th  Eigenvector  of  the  discrete  problem  plotted  with 
the  exact  solution  for  the  limit  as  N  ^  oo.  The  hypergeo¬ 
metric  function  in  the  figure  is  a  hfth  degree  polynomial. 


Thus,  take  N  ^  oo  to  obtain  the  differential  equation 
for  the  eigenfunctions  of  the  continuous  propagator: 

<P 

—  x)u{x)]  = —k{k  —  l)u{x).  (21) 

Expanding  all  derivatives  yields 


IV.  APPLICATIONS  OF  THE  SPECTRAL 
SOLUTION 

The  solution  of  the  spectral  problem  yields  exact  ex¬ 
pressions  for  the  probability  distribution  at  any  future 
time  step.  With  such  a  strong  result,  quantities  that  de¬ 
pend  on  macrostate  probabilities  naturally  follow  from  it. 
In  this  section,  we  will  apply  the  solution  of  the  spectral 
problem  to  find  two  quantities.  The  first  is  the  expected 
time  for  the  system  to  reach  consensus  and  the  second  is 
the  expected  time  spent  at  each  macrostate  before  con¬ 
sensus.  Both  quantities  are  considered  functions  of  the 
initial  distribution  of  macrostates. 


A.  Consensus  Time 

A  topic  of  particular  importance  with  social  models 
such  as  this  is  the  expected  time  until  all  agents  in 
the  network  have  the  same  state.  Once  such  a  state  is 
achieved,  the  dynamics  halt  entirely.  In  this  section,  we 
will  provide  an  exact  expression  for  the  consensus  time. 
This  calculation  depends  on  the  initial  distribution  of 
the  model,  however  an  estimate  that  is  independent  of 
initial  data  can  also  be  determined.  We  know  that  if 
Qm  is  the  probability  of  reaching  consensus  at  step  to, 
then  the  consensus  time  is  Qrn'm.  We  can  de¬ 

termine  qm  from  the  boundary  conditions  of  the  tran¬ 
sition  equation.  The  probability  of  reaching  consensus 
at  time  step  to  -I-  I  is  the  probability  that  the  model  is 
in  consensus  at  time  to  -I-  I  minus  the  probability  it  was 
already  in  consensus  at  time  to.  Symbolically,  this  is 
Qm+i  =  —  ag™^  —  Now  from  the 

boundary  conditions,  we  have  that 


x{l 


,  <Pu  .  sdu  ,,  ,,  , , 

x)— ^  +  (2  -  4a;)—  -I-  {k{k  -  I)  -  2)u 
aa;^  ax 


0,  (22) 


{m+l) 

'0, 


—  a 


(m)  _ 
0,1V  ~ 


N 


Am) 

'■l.AT-l- 


(24) 


which  is  valid  on  0  <  x  <  I.  This  is  a  form  of  the  hy¬ 
pergeometric  differential  equation,  and  as  such,  the  basis 
for  the  solutions  are 

Uk{x)  =  2Fi{k  +  1,2  —  k,  2,  x)  (23) 

for  k  =  0, 1,2...  This  is  a  special  case  of  the  hypergeo¬ 
metric  function  in  which  the  series  expression  terminates 
for  each  k  [28].  This  implies  that  the  kth  eigenvector  of 
the  voter  model  approaches  a  polynomial  of  degree  k  —  2 
as  N  ^  oo.  When  fc  =  0  and  k  =  1,  the  solution  is 
u{x)  =  0.  This  differential  equation  does  not  describe 
the  behavior  of  the  eigenvectors  at  the  boundary,  which 
explains  why  the  first  two  eigenfunctions  are  trivial.  In 
figure  I,  the  seventh  eigenvector  for  the  discrete  model 
when  V  =  100  has  very  close  agreement  with  the  contin¬ 
uous  solution  posed  in  (23). 


Thus  qm  =  (^1™  +  a^^_^^)/N .  If  takes  com¬ 

ponents  o!'^\  and  letting  Vk  be  the  kth  eigenvector, 
then  we  can  express  any  such  distribution  as  = 

J2k=o  dk2^]^'Vk-  Here,  dk  is  the  initial  probability  dis¬ 
tribution  expressed  in  the  eigenbasis.  We  only  need  com¬ 
ponents  2  and  N  —  1  from  to  find  the  consensus  time,  so 
we  let  Sk  =  (ifc[vk]2  +  [vk]Af-i  and  write 

N 

a["^^+a^;rl,  =  J2skXT-^.  (25) 

k^2 

We  exclude  the  k  =  0  and  k  =  1  terms  since  and 
are  independent  of  those  eigenvectors.  Letting  V 
be  the  matrix  of  eigenvectors,  we  will  write  the  expected 
time  to  consensus  as 
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E[T\a^°'>  =  Vd] 


m—1  k—2 


N 


N 

E 

k=2 


1 


1 


:Sk 


N  "(1-Afc)2 


N 


Sk 


k=2 


(fc(fc-  1))2- 


(26) 

(27) 

(28) 


This  is  an  exact  formula  given  the  initial  distribution. 
Now  let  us  estimate  this  quantity  to  ascertain  asymptotic 
information  of  consensus  times.  To  do  this,  we  need  to 
find  a  restriction  on  Sfc.  Consider  the  limit  of  the  system 
as  m  — >  oo.  Since  the  system  will  always  reach  consensus 
in  finite  time,  we  have  that  09°°^  +  =  1-  Let  us  now 

estimate  the  right  hand  side  of  (25)  by  using  the  largest 
available  eigenvalue,  A2,  to  obtain  that  for  some  s. 


<sA™-i.  (29) 

If  we  sum  both  boundaries  in  (24)  over  all  m,  the  left 
hand  side  is  telescopic.  If  we  assume  that  the  initial 
probability  of  consensus  is  not  0(1),  then  adding  the  two 
boundaries  together  gives  that 

(^)s  =  0(l)  (30) 

s  =  0{l/N)  (31) 

With  this,  we  obtain  the  asymptotic  behavior  of  the 
expected  time  to  consensus  as 


=  Vd]  <  O  £  A™-V 

^  ^  m—1 


=  0{N^) 


1 

(1-A2)2 


(32) 

(33) 

(34) 


Thus,  as  a  uniform  estimate  for  general  initial  prob¬ 
ability  distributions  of  the  model,  the  asymptotic  be¬ 
havior  of  the  expected  time  to  consensus  is  quadratic, 
which  is  consistent  with  previous  results  [9,  22,  23]. 
Using  continuous  time  methods,  the  expected  time  to 
reach  consensus  given  a  opinion  density  p  is  E[T]  = 

iV^  [(I  —  p)  In  -f  pin  ij  [9].  The  exact  solution  and 


the  bound  we  found  here  are  improvements  upon  this  re¬ 
sult  since  they  are  valid  for  small  values  of  N  and  general¬ 
ize  the  theory  to  general  initial  probability  distributions. 


B.  Macrostate  Times 

We  define  macrostate  times  as  the  expected  frequency 
of  each  state  of  the  random  walk  prior  to  consensus.  Un¬ 
like  the  consensus  time,  macrostate  times  will  be  orga¬ 
nized  as  a  vector  whose  components  correspond  to  each 


state.  The  consensus  states  are  not  included  because 
they  have  an  infinite  macrostate  time.  For  non-consensus 
states,  we  expect  macrostate  times  to  be  finite  since 
the  consensus  time  is  also  finite.  Macrostate  times  are 
stronger  quantities  than  the  consensus  time  since  the  sum 
of  all  macrostate  times  is  identical  to  the  consensus  time. 


1.  Discrete  Time  Solution 

Since  the  solution  of  the  spectral  problem  is  known, 
computing  macrostate  times  becomes  straightforward. 
Let  Mjim)  be  the  total  number  of  visitations  of 
macrostate  j  by  time  m  and  let  AMj(m)  =  Mj{m)  — 
Note  that  Mj{m)  depends  on  the  outcome  of 
the  random  walk  for  ua-  Since  is  the  defined  as  the 
probability  that  nA(rn)  =  j,  we  have  that  AMj{m)  takes 
value  1  with  probability  Otherwise,  it  takes  value 

0.  Thus,  for  j  =  1...N  —  I,  we  can  write  the  macrostate 
time  as 


E[Mj]  =  E[AMM)]  =  E  (35) 

m—0  m—0 

We  can  use  the  solution  to  the  spectral  problem  to 
compute  this  infinite  series  exactly.  Let  M  be  a  vector 
whose  components  are  E[Mj].  Also,  letting  dk  be  the 
initial  distribution  of  the  macrostates  expressed  in  the 
eigenbasis,  we  have  that  the  time  m  probability  distri¬ 
bution  is  The  fc  =  0  and  k  =  1 

terms  in  the  sum  are  the  contributions  of  the  consensus 
states  to  the  probability  distribution.  Since  consensus  is 
a  frozen  state,  the  probability  distribution  for  the  rele¬ 
vant  macrostates  are  independent  of  the  first  two  terms 
in  this  sum.  The  macrostate  times  can  be  found  once 
these  terms  are  discarded: 

00  N 

U[M]  =  EE  dfeVfcA^  (36) 

m—0  k—2 

N  , 

The  first  and  last  components  of  this  vector  are  irrel¬ 
evant  since  it  is  understood  that  Mq  =  Mm  =  00.  The 
remaining  components  of  M  are  the  exact  values  for  the 
macrostate  times. 

We  perform  Monte  Carlo  simulation  to  reinforce  this 
result.  Three  cases  of  the  initial  distribution  are  consid¬ 
ered:  =  5j^M/2,  af^  =  <5i,iv/4,  and  Vj  = 

0...N.  Figure  2  shows  that  there  is  good  agreement  be¬ 
tween  the  exact  solutions  and  the  results  from  the  simu¬ 
lations.  A  particularly  interesting  case  is  when  the  initial 
distribution  is  uniform.  This  distribution  happens  to  be 
the  eigenvector  corresponding  to  the  second  largest  eigen¬ 
value  of  the  transition  matrix.  So,  in  equation  (37),  we 
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Initial  Distribution:  HaCO)  =  N/2 


Initial  Distribution:  HaCO)  =  N/4 


Macrostate 


Uniform  Initial  Distribution 


Macrostate 


FIG.  2.  Three  cases  of  the  initial  distribution  are  examined. 
The  exact  expression  for  the  macrostate  time  in  equation  (37) 
is  compared  with  averaged  results  from  a  Monte  Carlo  simu¬ 
lation  over  3000  runs  of  the  voter  model  when  N  =  100. 


know  that  do  = 


and  dk  =  0  otherwise.  Therefore, 


■‘2  ^_l_l 

the  time  spent  at  each  macrostate  is  also  uniform  with 
value 


2(7V-I-1) 


2.  Continuous  Time  Solution 

Here,  we  will  find  the  expected  time  spent  per 
macrostate  as  fV  — >■  oo  using  continuous  time  methods. 
While  the  discrete  solution  is  valid  for  every  N,  the  con¬ 
tinuous  time  solution  will  provide  some  additional  insight 
into  its  behavior  for  large  N.  Let  pj  =  j/N,  tm  =  m/N, 
and  u(pj,tm)  =  Using  this,  the  Fokker-Plank 

equation  for  the  model  is 

du  ,  ,  ,, 

Furthermore,  let  M{p)  be  the  expected  time  spent  at 
density  p  prior  to  consensus.  As  iV  — >  oo,  we  have  that 

OO  pQQ 

=  X!  /  u{p,t)dt.  (39) 

m=0  •^0 

Now,  integrate  equation  (38)  from  t  =  0  to  t  =  oo  to 
obtain 


u(p,  oo)  -  u(p,  0)  =  -p)ZU(p)]-  (40) 

Since  the  system  reaches  consensus  in  finite  time,  we  have 
that  u{p,oo)  =  0.  Let  f{p)  =  u{p,0)  and  T{p)  =  p(l  — 
p)M{p)/N.  Then,  the  problem  becomes 


-fip) 


d^T 

dp"^ 


(41) 


with  boundary  conditions  r(0)  =  T(l)  =  0.  The  solution 
of  this  problem  is  found  by  determining  the  Green’s  func¬ 
tion  of  the  differential  operator  [29].  For  this  problem, 
the  Green’s  function  is  given  by 


9{P,C) 


P(1  P  <  ? 

C(1  -P)  P>k 


(42) 


With  this  solution,  we  have  that 

np)=  f  fiOKp^m-  (43) 

Jo 

Therefore,  the  expression  for  macrostate  time  for  large 
N  is 


M{p)  ~  N 


f{C)9{Pii)d^- 


(44) 
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where 


5(p,e)=  I  i  •  (45) 

The  Green’s  function,  g{p,Oj  is  the  solution  to 
macrostate  times  when  the  initial  density  is  specified  as  a 
given  value  Furthermore,  when  the  initial  probability 
distribution  is  uniform,  the  solution  is  also  uniform  with 
value  M (p)  =  N/2  as  we  have  observed  with  the  discrete 
formulation. 


V.  THE  COMPLETE  BIPARTITE  GRAPH 


^  xvQ^^K  (47) 

With  this  simplihcation,  there  are  three  cases  need  to  be 
accounted  for: 

1.  yu^  XV  — 

2.  yu  XV  — 

3.  XV  <€.yu  — 5-  Qy^'^  <C 

Consider  the  first  case.  Since  we  can 

combine  the  two  terms  together  to  obtain 


The  methods  we  have  developed  for  the  complete  graph 
can  be  extended  to  more  complex  networks.  In  particu¬ 
lar,  we  will  consider  the  complete  bipartite  graph.  In  this 
case,  nodes  in  the  network  are  divided  into  two  groups. 
Every  node  in  a  group  is  connected  to  every  node  in  the 
other  group.  The  complete  bipartite  graph  can  also  be 
defined  as  the  complement  of  two  complete  graphs.  Let 
Ni  be  the  number  of  nodes  in  the  first  group  and  N2  be 
the  total  number  of  nodes  in  the  second  group.  Also,  let 
(m)  and  (m)  be  the  number  of  nodes  with  opinion 
A  in  groups  I  and  2  respectively. 

Letting  =  Pr{n^^\m)  =  i,  [m)  = 

j),  and  defining  a  power  series  Q^'^\x,y,u,v)  = 

J2i,j  ,  we  can  use  the  procedure  laid 

out  in  section  II  to  find  the  single  step  propagator  for  the 
model: 


t'jx  -  y) 
NN2 


NNo 


y{u  -  v) 

^yu 

NNi 

y)  x{u 

-u)1 

NNi  \ 

QM  =  A+„qW.  (46) 


Unlike  for  the  complete  graph,  the  exact  solution  to 
the  spectral  problem  for  this  equation  will  not  be  found 
with  these  methods.  However,  we  can  apply  some  as¬ 
sumptions  to  the  system  that  can  reduce  this  to  obtain 
an  approximation  of  the  size  of  the  spectrum.  In  partic¬ 
ular,  we  wish  to  find  the  approximate  size  of  the  spectral 
gap,  since  this  governs  the  expected  time  to  consensus. 
More  detailed  solutions  such  as  the  expected  time  spent 
per  macrostate  will  not  be  found. 

If  we  take  the  single  step  propagator  to  continuous 
time,  it  is  known  that  the  system  approaches  equilibrium 
when  n'^ /Ni  ~  n'^ /N2  [9].  Furthermore,  the  time  to 
reach  this  equilibrium  state  is  negligible  compared  to  the 
time  to  reach  consensus.  Along  this  line,  diffusion  gov¬ 
erns  the  motion  of  the  macrostate  of  the  system  instead 
of  drift.  The  study  of  the  behavior  of  the  probability  dis¬ 
tribution  is  most  valuable  when  drift  can  be  neglected. 
As  such,  we  will  make  this  assumption  in  equation  (46) 
to  obtain 


{U  -  V){X  -  y)  (ra) 

N1N2 


(48) 


Letting  G{x,y,u,v)  =  Cijx'^y^^  ■’,  the  spec¬ 

tral  problem  is  given  by 


{u  -  v){x  -  y)Gyu  ^  NiN2{X  -  l)G.  (49) 

Let  s  =  u  —  v,r  =  x  —  y,  and  i7(r,  y,  s,  v)  =  G{x,  y,  u,  v). 
As  with  the  complete  graph,  this  is  a  linear  trans¬ 
formation,  so  we  expect  solutions  of  the  same  form: 
H{r,y,s,v)  =  The  differential 

equation  becomes 


rs{Hsy-Hrs)^NiN2{X-l)H.  (50) 

The  corresponding  difference  equation  for  the  coeffi¬ 
cients,  bij,  is 


j{Ni  -  i  +  I)5i_ij  -  ijbij  «  fVifV2(A  -  l)bij.  (51) 

With  similar  arguments  for  the  complete  graph,  we  re¬ 
quire  a  singularity  in  this  difference  equation  so  that  the 
solution  is  not  trivial.  Therefore,  we  obtain  an  approxi¬ 
mate  spectrum  for  the  diffusive  scale  of  the  two  dimen¬ 
sional  random  walk: 


I  - 


N1N2' 


(52) 


Note  that  in  procedure,  the  same  spectrum  would 
be  recovered  if  one  were  to  use  Gxv  instead  of  Gyu  in 
equation  (49).  This  does  not  produce  the  spectrum  for 
the  complete  bipartite  graph,  but  rather  an  approximate 
form  when  the  walk  is  dominated  by  diffusion. 

Now  let  us  consider  cases  2  and  3  together.  Without 
loss  of  generality,  results  from  case  3  can  be  recovered 
by  case  2  through  interchanging  u  x,  y  v,  and 
Ni  ■n-  N2.  Physically,  this  corresponds  to  interchanging 
the  labels  on  the  two  groups  of  nodes.  Taking  case  2  as 
the  archetype,  drop  small  terms  to  simplify  the  propaga¬ 
tor  to 


/^(m)  I  /^(m)  _  (^(m) 

NNi^y^  7Vi7V2^^“ 


A+^gM.  (53) 


There  is  no  need  to  change  variables  to  solve  the  corre¬ 
sponding  spectral  problem  since  the  corresponding  finite 
difference  equation  explicitly  determines  cy.  Therefore, 
the  approximate  spectrum  for  this  case  is  found  to  be 


1  - 


2tj 

N1N2' 


(54) 


The  two  forms  of  the  approximate  spectrum  found  in 
equations  (52)  and  (54)  are  very  similar.  The  discrepancy 
between  the  two  is  due  to  the  errors  invoked  by  making 
the  necessary  approximations.  To  find  the  approximate 
eigenvalues,  we  required  that  the  sizes  of  both  sides  of 
the  spectral  problem  should  be  comparable.  Since  we 
used  scaling  arguments  to  find  the  approximate  spec¬ 
trum,  there  will  be  no  asymptotic  discrepancy  in  the 
spectral  gap  through  this  procedure.  These  results  in¬ 
dicate  that  the  spectral  gap  is  1  —  A2  =  O  ■ 

We  can  repeat  the  calculation  in  section  IV  to  find 
a  bound  on  consensus  times  for  the  complete  bipartite 
graph.  Following  this  procedure,  we  find  that  the  ex¬ 
pected  consensus  time  is 


E[T]  =  OiNiN2). 


(55) 


This  is  consistent  with  continuous  time  analysis,  which 
shows  that  the  expected  time  to  consensus  is  E[T]  = 

47Vi  V2  (1  —  w)  In  -I-  a;  In  i  ,  where  w  is  the  degree 

weighted  mean  of  microstates  [9].  As  before,  this  bound 
is  valid  for  all  initial  probability  distributions.  However, 


without  detailed  information  about  the  eigenvectors,  we 
cannot  extract  more  detailed  information  about  the  prop¬ 
agator  than  the  bound  on  consensus. 


VI.  CONCLUSIONS 


We  have  successfully  derived  exact  solutions  to  the 
voter  model  on  the  complete  graph.  In  particular,  the 
solution  to  the  spectral  problem  can  be  found  exactly, 
which  allows  us  to  find  the  time  m  probability  distribu¬ 
tion  of  the  model.  Also,  knowing  the  eigenvalues  for  the 
discrete  matrix  problem,  we  found  the  solution  to  the 
corresponding  differential  eigenvalue  problem.  The  solu¬ 
tions  of  which  are  found  to  be  hypergeometric  functions 
that  have  terminating  series  expressions.  Exact  formu¬ 
lae  for  the  expected  time  to  consensus  and  the  expected 
frequency  of  each  macrostate  prior  to  consensus  are  also 
given,  though  there  are  other  quantities  of  interest  that 
can  be  found  using  the  theory  given  above. 

The  means  by  which  these  exact  solutions  are  found 
also  provide  mathematical  insight.  Since  the  procedures 
are  easily  generalizable,  there  is  great  potential  for  ap¬ 
plying  these  techniques  to  other  problems.  This  includes 
extensions  of  the  voter  model,  imposing  it  upon  different 
graphs,  and  other  models  entirely.  There  are  other  urn 
models  of  interest  in  the  context  of  statistical  physics  and 
network  science  that  may  be  studied  using  these  tech¬ 
niques.  The  procedure  for  determining  the  single  step 
propagator  can  be  generalized  to  finding  probabilities  of 
microstates  instead  of  macrostates,  which  may  be  insight¬ 
ful  for  a  network  with  specific  topological  features.  With 
such  a  formulation,  a  general  network  can  be  studied  in 
detail.  Rigorous  treatment  of  these  claims  are  potential 
areas  of  future  work. 
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